Information geometry, dynamics and discrete quantum mechanics 



Marcel Reginatto 
Physikalisch- Technische Bundesanstalt, 
Bundesallee 100, 38116 Braunschweig, Germany 

Michael J. W. Hall 

Centre for Quantum Dynamics, 
Griffith University, Brisbane, 
QLD ^iii, Australia 



Presented at MaxEnt 2012, the 32nd International Workshop on Bayesian Inference and Maximum En- 
tropy Methods in Science and Engineering, July 15-20, 2012, Garching near Munich, Germany. 



1 



Abstract 

We consider a system with a discrete configuration space. We show that the geometrical struc- 
tures associated with such a system provide the tools necessary for a reconstruction of discrete 
quantum mechanics once dynamics is brought into the picture. We do this in three steps. Our 
starting point is information geometry, the natural geometry of the space of probability distribu- 
tions. Dynamics requires additional structure. To evolve the P'', we introduce coordinates S'^ 
canonically conjugate to the P'' and a symplectic structure. We then seek to extend the metric 
structure of information geometry, to define a geometry over the full space of the P*^ and S''. 
Consistency between the metric tensor and the symplectic form forces us to introduce a Kdhler 
geometry. The construction has notable features. A complex structure is obtained in a natural 
way. The canonical coordinates = \fP^e}^^ of the Kahler space are precisely the wave functions 
of quantum mechanics. The full group of unitary transformations is obtained. Finally, one may 
associate a Hilbert space with the Kahler space, which leads to the standard version of quantum 
theory. We also show that the metric that we derive here using purely geometrical arguments 
is precisely the one that leads to Wootters' expression for the statistical distance for quantum 
systems. 

PACS numbers: 03.65.Ta, 02.40.Tt, 02.40.Yy 
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I. INTRODUCTION 



In this paper, we consider systems with a discrete configuration space. In the presence of 
uncertainty, the state of a classical system will be described by a probability P — (P^, P"), 
where n is the number of available states. An analysis complementing the one carried out 
in Ref. [1], for continuous systems, leads to the remarkable result that the geometrical 
structures associated with such a system provide the tools necessary for a reconstruction of 
discrete quantum mechanics once dynamics is brought into the picture. The reformulation 
of Ref. [1] presented here is nontrivial: it requires new geometrical insights because assump- 
tions that are natural in the infinite dimensional case (e.g., "spatial locality," the action of 
the Galilean group, etc.) are no longer available in the discrete case. 

This reconstruction of discrete quantum mechanics has notable features. We show that 
the natural geometry of the space of probabilities in motion is a Kahler geometry. The 
canonical coordinates ijj^ = y/P^e^^^ of the Kahler space are precisely the wave functions 
of quantum mechanics. The full group of unitary transformations is obtained. Finally, 
one may associate a Hilbert space with the Kahler space, which leads to the standard 
version of quantum theory. We note that the Kahler space metric that we derive here using 
purely geometrical arguments is precisely the one that leads to Wootters' expression for the 
statistical distance for quantum systems [2], which he derived using a completely different 
argument based on the concept of distinguishability. Comparison of our approach to papers 
of Mehrafarin [3] and Goyal [4, 5], which also start from classical information geometry, will 
be made elsewhere. 



II. INFORMATION GEOMETRY 



The starting point is a classical system which has associated with it n different states. 
The probability that the system is in state i is given by P*, i = l,...,n, with P* > and 
P* = 1. There is a natural line element given by 

ds^ = Gij dP' dP^ = — % dP' dP^ (1) 

where a is a constant. The value of this constant can not be determined a priori; it is 
usually set to |. We do not make this assumption here but instead allow a to be a free 
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parameter. The metric Gij is known as the information metric, 

Gi 



2pi 



(2) 



The hne element of Eq. (1) leads to a concept of distance on a probability space. This 
distance seems to have been introduced into statistics by Bhattacharyya [6, 7] as a way of 
providing a measure of divergence for multinomial probabihties [8]. Wootters calls it the 
statistical distance [2]. To derive the statistical distance from the metric Gij, consider two 
points in probability space, Pa and Pb, joined by a curve -P*(i), < i < 1, and write the 
expression for the length / of the curve in the form 



■/o 



dt\l Gij 



dP\t) dPo{t) 



(3) 



dt dt 

The statistical distance is defined as the shortest distance between Pa and Pg. To compute 

the statistical distance, it is convenient to do the change of coordinates = y/P^. Then 



dt, 



\ 1=1 L 



dXi{t) 
dt 



(4) 



Since the curve P(t) is assumed to lie in the probability space, it must satisfy the condition 
J27=iP\t) = J27=i[^\'t)T — Ij th^t curve must lie on a unit n-dimensional sphere 

in the X space. The shortest distance on the n-dimensional sphere is equal to the angle 
between the unit vectors Xa and Xb- This leads immediately to 



d{PA, Pb) = cos-i = cos'^ \il\fPi4Ph 



(5) 



u=l 



which agrees with the expressions in the papers of Bhattacharyya and Wootters provided a 
is set to the standard value of \. 

The statistical distance does not play a fundamental role in our discussion. Nevertheless, 
we make reference to it here because it clarifies the relation of our work to that of Wootters. 



III. DYNAMICS, SYMPLECTIC GEOMETRY, AND OBSERVABLES 

We now set the probabilities in motion. Wc assume that the dynamics of P* are generated 
by an action principle and we introduce additional coordinates S'^ which are canonically 
conjugate to the P* and a corresponding Poisson bracket for any two functions P(P, S) and 
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GiP,S), 

dF dG dF dC 



dP'dS' dS'dP'J' 
As is well known, the Poisson bracket can be rewritten geometrically as 



(6) 



{F, G} = {dF/dP , dF/dS) Q I ^^/^^ ^ (7) 

dG/dS J 



where Q is the corresponding symplectic form, given in this case by 



Q = f " - I , (8) 




where 1 is the unit matrix in n dimensions. We thus have a symplectic structure and a 
corresponding symplectic geometry. The equations of motion for P* and iS* arc given by 

= {P\ H}, 5"* = {S\ H} where H is the Hamiltonian that generates time translations. 

The observables of the theory are functions A{P, S) of the coordinates P* and S'^. Certain 
restrictions are imposed on them, so not every function is an observable. For example, the 
infinitesimal canonical transformation generated by any observable A must preserve the 
normalization and positivity of P. This implies the two conditions [9] 

A{P,S + c) = A{P,S), dA/dS' ^Oif P' ^0. (9) 

Note that the first condition implies gauge invariance of the theory under ^ S'^ + x- 



IV. KAHLER GEOMETRY 



We now want to consider the following question: Can we extend the metric Gij in Eq. (2), 
which is only defined on the n-dimensional subspace of probabilities P*, to the full 2n- 
dimensional phase space of the P* and 5**? It can be done, but certain conditions which 
ensure the compatibility of the metric and symplectic structures have to be satisfied. These 
conditions are equivalent to requiring that the space have a Kahler structure (see the Ap- 
pendix of Ref. [1] for a proof). We are led then to the beautiful result that the natural 
geometry of the space of probabilities in motion is a Kdhler geometry. 

A Kahler structure brings together metric, symplectic and complex structures in a har- 
monious way. To define such a space, introduce a complex structure J"^ and impose the 
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following conditions [10], 



^ab 9acJ b i 

J^gabJ^d = 9cd , 

ja jb _ _ra 

-J b-J c — " c ■ 



(10) 

(11) 

(12) 



Eq. (10) is a compatibility equation between the symplectic structure flab and the metric 
Qab, Eq. (11) is the condition that the metric should be Hermitian, and Eq. (12) is the 
condition that J"^ should be a complex structure. 

We derive the solutions to these equations. The metric over the subspace of probabilities 
is the information metric, Eq. (2). The metric over the full space will take the form 

G E 
F 



gab 



(13) 



where G = diag(2pi), and E and F are n x n matrices that need to be determined using 
the Kahler conditions and the expression for flab, Eq. (8). A matrix calculation leads to 
general forms for the metric gab and the complex structure J"^. These are 



9ab — 



ja _ 



V 



-G -GAG"^ 



(14) 



A (1 + A')G-^ 
-G -GAG"^ 

where the n x n matrix A satisfies GAG~^ = A^ but is otherwise arbitrary. 

The Kahler conditions restrict the form of the metric gab but leave the matrix A (and 
therefore the geometry) undetermined. To fix the geometry, it is necessary to introduce an 
additional condition. As wc show in the Appendix, there is a natural requirement based on 
the idea of invariance of the metric under the motions generated by the observables of the 
theory. This condition leads to A = and, as shown below, to a very simple geometry. The 
condition A = alternatively follows by assuming the extended metric has zero curvature, 
analogously to the classical information metric Gij. 



V. COMPLEX COORDINATES 



We set A = and consider the Kahler structure given by 





' 1^ 




\ 




^ G-i^ 


^ab — 




9ab = 










,-1 oj' 




G-'J 







(15) 



We now carry out the Madelung transformation, t/j^ — y/P^e:K.p{iS^/a), ■0' = 
y/P^ exp(—iS'^ / a) . In terms of these complex coordinates, the tensors that define the Kahler 
geometry, Eqs. (15), take the standard form which is characteristic of a fiat-space [10], 





^ ial] 










^ab — 




gab = 




, J\ = 1 






1 —ial , 




J 




V 



(16) 



This shows that the simplest geometrical formulation of the space of probabilities in motion 
has a natural set of fundamental variables, ip^ and -0*. If we set the constant a equal to h, 
these fundamental variables are precisely the wave functions of quantum mechanics. 

This is a remarkable result because we have not introduced any assumptions that concern 
quantum mechanics, only geometrical arguments. 



VI. STATISTICAL DISTANCE IN THE KAHLER SPACE 



We now want to introduce a new expression for statistical distance which will be valid 
in the Kahler space. In going from the n-dimensional space of probabilities to the full 
2n-dimensional phase space of the and 5", the metric has been extended, and this should 
be taken in consideration when carrying out the generalization of the statistical distance. 

Consider two points ipA and ipB representing states in this space which are joined by a 
curve ip^{t), < t < 1. Combining Eq. (3) with Eq. (16), the correct generalization of the 
expression for the distance I will be given by 



(17) 



where we have introduced the notation |('0,'0)| = \/^i0^- 

The statistical distance in the Kahler space is defined as the shortest distance computed 
with Eq. (17). Since the curve P{t) is assumed to lie in the probability space, then ^(t) 
must satisfy the condition 

i = EntWit), (18) 

i 

that is, the curve must lie on the unit sphere in the {^/'(t), ^(t)} space. The shortest distance 
on the unit sphere is equal to the angle between the unit vectors ipA and ipB- This leads 
immediately to the expression for the statistical distance that appears at the end of section 
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Ill of Wootters' paper (no equation number) provided a is set to the standard value of |, 

d{lljA,^B) = V2^C0S-^\{ll^A,^B)\. (19) 

In this way we provide a very brief, geometrical derivation of Wootters' expression for 
the statistical distance in quantum mechanics, which he derived using a completely different 
argument based on distinguishabihty [2] . 



VII. GROUP OF LINEAR UNITARY TRANSFORMATIONS 

We now examine the transformations which are allowed by the theory As pointed out 
briefly in the discussion on observables, there are some basic conditions that must be sat- 
isfied; we list them and derive the group of transformations that is consistent with such 
conditions. 

The first requirement is that the transformations preserve the normalization of the prob- 
ability, Y.iP' = Ei^'i'' = 1- 

The second requirement is that the metric be form invariant under those transformations; 
i.e., that the hne element da'^ — 2aJ2j d't/j^di/j^ of the Kahler space is preserved by the 
transformations. 

Requiring normalization of the probability and metric invariance leads to the group of 
rotations on the 2?T,-dimensional sphere. Such rotations are linear with respect to and 
tjj^ . For an infinitesimal transformation, it follows that 

dH f. dH 

are linear in %p and ip, where H is the Hamiltonian that generates the motion. Then H must 
be of the form 

H = E{t) + [Mjki^'il^" + Njki^^i/j'' + Njk^P^^''] (21) 

where E{t) is a arbitrary function of time, M is Hermitian, and is symmetric. 

The third and final requirement is that we only consider rotations on the 2n- dimensional 
sphere that are compatible with the equations of motion. As pointed out in the discussion on 
observables, conservation of probability requires that the ensemble Hamiltonian be invariant 
(up to an additive constant) under — >■ + x, since to first-order [9] 

= eY.P^ = eY,^^H{P,S + e)- H{P, S). (22) 
j j 



This condition, when written in terms of complex coordinates, is equivalent to invariance of 
the Hamiltonian under t/j — >■ ■0e'^. Using the notation Q :— Y.Njk'^^'^^ , Eq. (21) leads to 
the equality 

(23) 



ge^*^ + Qe 







which must be valid for all x- Differentiating with respect to x gives the additional equality 



2i 



-2ix 



0. 



(24) 



Combining these two expressions leads to Q = 0. Since this must hold for all ip, it follows 
that Njh, = 0, i.e., the ensemble Hamiltonian has the Hermitian form 



(25) 



as desired. 

This shows that the group of transformations of the theory is precisely the group of linear 
unitary transformations. Note that all that is used here, in moving from all rotations on 
the 2n-dimensional sphere to the subset of unitary transformations, is (i) the conservation 
of probability and (ii) that the equations of motion follow from an action principle. 

VIII. HILBERT SPACE FORMULATION 



There is a standard construction that associates a complex Hilbert space with any Kahler 
space. Given two complex vectors 0* and define the Dirac product by [11] 

^ Vi (ch\dA ■ \a + in] ■ 



= -E 



1 

1 



+ i 



\ 





(26) 



This suggests that the Hilbert space structure of quantum mechanics is perhaps not as 
fundamental as its geometrical structure. 
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IX. CONCLUDING REMARKS 



We have shown that the Hilbert space formulation of discrete quantum theory emerges 
from the geometry of probabihties in motion. The basic elements that go into this geo- 
metrical reconstruction of discrete quantum mechanics are the natural metric on the space 
of probabilities (information geometry), the description of dynamics using a Hamiltonian 
formalism (symplectic geometry), and requirements of consistency (Kahlcr geometry). 

We summarize some of the remarkable features of this construction. The wave functions 
of quantum mechanics, — -\/P^e*'^*, appear as the natural complex coordinates of the 
Kahler space that describes the geometry of probabilities in motion. The full group of unitary 
transformations is derived based on consistency requirements. And, finally, a Hilbert space 
may be associated with the Kahler space of the theory, which leads to the standard version 
of quantum theory. 

We have shown that we can derive Wootters' statistical distance for quantum mechanics, 
Eq. (17), in a purely geometrical way. We have commented in this brief paper on the 
connection between our work and that of Wootters; it would also be interesting to investigate 
the connection of our work to papers of Mehrafarin [3] and Goyal [4, 5] which also take an 
information-geometrical approach. One of the main differences seems to come from the 
emphasis that we place on finding a geometrical description for the space of probabilities 
in motion. The use of an action principle to describe the dynamics of the probabilities 

introduces geometrical structure that is quite powerful: we immediately get a doubling 
of the dimensionality of the space (i.e., {-P*} — > {P',5'*}) and, in addition, we end up 
with a complex structure and unitary transformations, all of them ingredients that are 
essential for quantum mechanics. In our formalism, these are mainly the result of consistency 
requirements which wc impose to ensure the peaceful coexistence of information geometry 
and symplectic geometry. 

It appears then that discrete quantum mechanics stands at the intersection of information 
geometry, symplectic geometry, and Kahler geometry. 
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Appendix A: The condition A = 

To fix the matrix A that appears in Eqs. (14) we need to supplement the Kahler condi- 
tions, Eqs. (10-12), with an additional condition. We make use of the fact that motion in the 
probability space (for example, the trajectory in phase space generated by an observable) 
preserves the normalization J2i — ^- To look more closely at this constraint, we introduce 
the new set of 2n real coordinates 

x' = V2aP^ cos [S'/aj , y' = V2aP^ sin (S'/a) . (Al) 

One can check that this coordinate transformation is also a canonical transformation, 
{x^,y^} = so the symplectic form remains invariant. 

The constraint Z)i-P* = 1 takes the form Y,i{{x^y + (l/*)^} = 2a in the new coordinates. 
Therefore, motion is restricted to a 2n-dimensional sphere in the space of the x^ and y*. 
We require that the metric be invariant under rotations (equivalently, invariant under the 
motions generated by observables) ; i.e., that it be spherically symmetric. In the coordinates 
x^ and y\ a spherically symmetric line element has the general form 

dp' = m E {(dxr + {dyr} + 9{r) |E(^'^^' + y'dy')^ (A2) 

where / and g are arbitrary functions of r = \fj2i{{x^y + (l/*)^}- In terms of the original 
coordinates and S'', the spherically symmetric metric takes the form 

dp' = /(En E [w^idPf + ^(dSr^ + g{j:P^) |aE^^^} (A3) 

The important point here is that the metric dp^ does not have any mixed terms proportional 
to dP'^dS^ , and this means that we must set A = to satisfy our requirement of metric 
invariance. 
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